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<210> 1 
<211> 2562 
<212> DNA 

<213> Human immunodeficiency virus type 1 



<220> 
<223> 



full length human immunodeficiency virus -1 (HIV-1) 
envelope glycoprotein (Env, gpl60) from primary 
R5X4 isolate 8 9.6 



<400> 1 

atgagagtga 

gggatgttga 

gtacctgtgt 

gatacagagg 

caagaagtag 

gatcagatgc 

ttaaccccac 

aatcccacta 

tatatcacca 

gatgtagtac 

tcagtcatta 

gtcccggctg 

tgcacaaatg 

ctgctgttaa 

gacaatgcta 

cccaacaaca 

agaaacataa 

aacactttac 

tttaatcaat 

gaatttttct 

acaaatggca 

aatatgtggc 

tgttcatcaa 

gagactgaga 

tataaatata 



aggagatcag 
tgatctgtag 
ggagagaagc 
tacataatgt 
tattgggaaa 
atgaggatat 
tctgtgttac 
gtagcagctg 
caagcataag 
caatagaaaa 
cacaggcctg 
ggtttgcgat 
tcagcacagt 
atggcagtct 
aaaccataat 
atacaagaag 
taggagatat 
aacagatagt 
cctcaggagg 
actgtaatac 
ctgaaggaaa 
agaaagtagg 
atattacagg 
tcttcagacc 
aagtagtaag 



gaagaattgg 
tgctgcaaaa 
aaccaccact 
ttgggccaca 
tgtgacagaa 
aatcagttta 
tttaaattgc 
gggaatgatg 
aaataaggta 
tactaataat 
tccaaaggta 
gctaaagtgt 
acaatgtaca 
agcagaagaa 
agtacagcta 
aaggttatct 
aagacaagca 
tataaaatta 
ggacccagaa 
agcacaactg 
tgacataatc 
aaaagcaatg 
gctgctacta 
tggaggagga 
aattgaacca 



cagcacttga 
gaaaaaacgt 
ctattttgtg 
catgcctgtg 
aattttaaca 
tgggatgaaa 
actaatttga 
gagaaaggag 
aagaaagaat 
actaagtata 
tcctttcagc 
aacaataaga 
catggaatta 
gacatagtaa 
aatgaatctg 
ataggaccag 
cattgtaaca 
agagaaaaat 
attgtaatgc 
tttaatagta 
acactccaat 
tatgcccctc 
acaagagatg 
gatatgaggg 
ataggagtag 



gagggggcat 
gggtcacaat 
catcagatgc 
tacccacaga 
tgtggaaaaa 
gcctaaagcc 
atatcactaa 
aaataaaaaa 
atgcactttt 
ggttaataag 
caattcccat 
cattcaatgg 
ggccagtggt 
ttagatctga 
tagtaattaa 
ggagagcatt 
ttagtagagc 
ttaggaataa 
acagttttaa 
cttggaatgt 
gcagaataaa 
ccatcacagg 
gaggtaatag 
acaattggag 
cacccaccag 



cttgctcctt 
ctattatggg 
taaagcctat 
ccccaaccca 
taacatggta 
atgtgtaaaa 
gaatactact 
ttgctctttc 
taatagactt 
ttgtaacacc 
acattattgt 
atcaggacca 
gtcaactcaa 
aaatttcaca 
ttgtacaaga 
ttatgcaaga 
aaaatggaat 
aacaatagcc 
ttgtggaggg 
tactggaggg 
acaaattata 
acaaattaga 
tactgagact 
aagtgaatta 
ggcaaagaga 



60 

12 0 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 



1 



agaacagtgc 
ggagcagcag 
ttattgtctg 
catatgttgc 
gaaagatacc 
tgcaccactt 
aataacatga 
gacttacttg 
gataaatggg 
ttattcataa 
atagtaaata 
tcgaggggac 
agatccggtc 
tgcctcttcc 
cttctgggac 
agccaggaac 
gaggggacag 
cctacaagaa 



aaagagaaaa 
gaagcactat 
gtatagtgca 
aactcacagt 
taagggatca 
ctgtgccttg 
cctggatgga 
aaaaatcgca 
caagtttgtg 
tgatagtagg 
gagttaggca 
ccgacaggcc 
cattagtgaa 
tctaccacct 
gcagggggtg 
taaagaatag 
atagggttat 
tcagacaggg 



aagagcagtg 
gggcgcagcg 
gcagcagaac 
ctggggcatc 
acagctcatg 
gaatgttagt 
gtgggaaaga 
aacccaacaa 
gaattggttt 
aggcttgata 
gggatattca 
cgaaggaaca 
cggattcttg 
cttgagaaac 
ggaagccctc 
tgctgttagc 
aaaaatagta 
cttggaaagg 



ggaataggag 
tcagtgacgc 
aatctgctga 
aagcagctcc 
ggaatttggg 
tggagtaata 
gaaattgaca 
gaaaagaatg 
gacataacaa 
ggtttaagaa 
ccattatcgt 
gaagaagaag 
gcacttttct 
ttactcttga 
aaatattggt 
ttgctcaacg 
caaagagctt 
gctttgctat 



ctgtgttcct 
tgacggtaca 
gggctattga 
aggcaagagt 
gttgctctgg 
aatctgtgga 
attacacaga 
aaaaagaatt 
actggctgtg 
tagtttttgc 
ttcagaccct 
gtggagagag 
gggtcgattt 
ttgtaacgag 
ggaatctcct 
ccacagccat 
gtagagctat 
aa 



tgggttcttg 
ggccaggcta 
ggcgcaacag 
cctggctctg 
aaaactcatt 
tgatatttgg 
ctatatatat 
attggaattg 
gtatataaga 
tgtactttct 
cctcccagcc 
agacagagac 
gaggaacctg 
gattgtggaa 
gcaatattgg 
agcagtagct 
tcgcaacata 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2562 



<210> 2 
<211> 853 
<212> PRT 

<213> Human immunodeficiency virus type 1 



<220> 

<223> full length human immunodeficiency virus-1 (HIV-1) 
envelope glycoprotein (Env, gpl60) from primary 
R5X4 isolate 89.6 



<400> 2 

Met Arg Val Lys 
1 

lie Leu Leu Leu 
20 

Thr Trp Val Thr 

35 

Thr Thr Leu Phe 
50 

His Asn Val Trp 
65 



Glu lie Arg Lys 
5 

Gly Met Leu Met 

lie Tyr Tyr Gly 
40 

Cys Ala Ser Asp 
55 

Ala Thr His Ala 
70 



Asn Trp Gin His 
10 

lie Cys Ser Ala 
25 

Val Pro Val Trp 



Ala Lys Ala Tyr 
60 

Cys Val Pro Thr 
75 



Leu Arg Gly Gly 
15 

Ala Lys Glu Lys 
30 

Arg Glu Ala Thr 
45 

Asp Thr Glu Val 



Asp Pro Asn Pro 
80 



Gin Glu Val Val Leu Gly Asn Val 
85 

Asn Asn Met Val Asp Gin Met His 
100 

Glu Ser Leu Lys Pro Cys Val Lys 
115 120 



Thr Glu Asn Phe Asn Met Trp Lys 
90 95 

Glu Asp lie lie Ser Leu Trp Asp 
105 110 

Leu Thr Pro Leu Cys Val Thr Leu 
125 



Asn Cys Thr Asn Leu Asn lie Thr Lys Asn Thr Thr Asn Pro Thr Ser 
130 135 140 

Ser Ser Trp Gly Met Met Glu Lys Gly Glu lie Lys Asn Cys Ser Phe 

145 150 155 160 



2 



Tyr lie Thr Thr Ser lie Arg Asn Lys Val Lys Lys Glu Tyr Ala Leu 
165 170 175 

Phe Asn Arg Leu Asp Val Val Pro lie Glu Asn Thr Asn Asn Thr Lys 
180 185 190 

Tyr Arg Leu lie Ser Cys Asn Thr Ser Val lie Thr Gin Ala Cys Pro 
195 200 205 

Lys Val Ser Phe Gin Pro lie Pro lie His Tyr Cys Val Pro Ala Gly 
210 215 220 

Phe Ala Met Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Ser Gly Pro 
225 230 235 240 

Cys Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val 
245 250 255 

Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Asp lie 
260 265 270 

Val lie Arg Ser Glu Asn Phe Thr Asp Asn Ala Lys Thr lie He Val 
275 280 285 

Gin Leu Asn Glu Ser Val Val He Asn Cys Thr Arg Pro Asn Asn Asn 
290 295 300 

Thr Arg Arg Arg Leu Ser He Gly Pro Gly Arg Ala Phe Tyr Ala Arg 
305 310 315 320 

Arg Asn lie He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg 
325 330 335 

Ala Lys Trp Asn Asn Thr Leu Gin Gin He Val He Lys Leu Arg Glu 
340 345 350 

Lys Phe Arg Asn Lys Thr He Ala Phe Asn Gin Ser Ser Gly Gly Asp 
355 360 365 

Pro Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 
370 375 380 

Cys Asn Thr Ala Gin Leu Phe Asn Ser Thr Trp Asn Val Thr Gly Gly 
385 390 395 400 

Thr Asn Gly Thr Glu Gly Asn Asp He He Thr Leu Gin Cys Arg He 
405 410 415 

Lys Gin He He Asn Met Trp Gin Lys Val Gly Lys Ala Met Tyr Ala 
420 425 430 

Pro Pro He Thr Gly Gin He Arg Cys Ser Ser Asn He Thr Gly Leu 
435 440 445 

Leu Leu Thr Arg Asp Gly Gly Asn Ser Thr Glu Thr Glu Thr Glu He 
450 455 460 

Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 
465 470 475 480 
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Tyr Lys Tyr Lys Val Val Arg lie Glu Pro lie Gly Val Ala Pro Thr 
485 490 495 

Arg Ala Lys Arg Arg Thr Val Gin Arg Glu Lys Arg. Ala Val Gly lie 
500 505 510 

Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly 
515 520 525 

Ala Ala Ser Val Thr Leu Thr Val Gin Ala Arg Leu Leu Leu Ser Gly 
530 535 540 

lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin 
545 550 555 560 

His Met Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg 
565 570 575 

Val Leu Ala Leu Glu Arg Tyr Leu Arg Asp Gin Gin Leu Met Gly lie 
580 585 590 

Trp Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ser Val Pro Trp Asn 
595 600 605 

Val Ser Trp Ser Asn Lys Ser Val Asp Asp lie Trp Asn Asn Met Thr 
610 615 620 

Trp Met Glu Trp Glu Arg Glu lie Asp Asn Tyr Thr Asp Tyr lie Tyr 

625 630 635 640 

Asp Leu Leu Glu Lys Ser Gin Thr Gin Gin Glu Lys Asn Glu Lys Glu 
645 650 655 

Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp lie 
660 665 670 

Thr Asn Trp Leu Trp Tyr lie Arg Leu Phe lie Met lie Val Gly Gly 
675 680 685 

Leu lie Gly Leu Arg lie Val Phe Ala Val Leu Ser lie Val Asn Arg 
690 695 700 

Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu Leu Pro Ala 
705 710 715 720 

Ser Arg Gly Pro Asp Arg Pro Glu Gly Thr Glu Glu Glu Gly Gly Glu 
725 730 735 

Arg Asp Arg Asp Arg Ser Gly Pro Leu Val Asn Gly Phe Leu Ala Leu 
740 745 750 

Phe Trp Val Asp Leu Arg Asn Leu Cys Leu Phe Leu Tyr His Leu Leu 
755 760 765 

Arg Asn Leu Leu Leu lie Val Thr Arg lie Val Glu Leu Leu Gly Arg 
770 775 780 

Arg Gly Trp Glu Ala Leu Lys Tyr Trp Trp Asn Leu Leu Gin Tyr Trp 
785 790 795 800 
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Ser Gin Glu Leu Lys Asn Ser Ala 
805 

lie Ala Val Ala Glu Gly Thr Asp 
820 

Ala Cys Arg Ala lie Arg Asn lie 
835 840 

Glu Arg Ala Leu Leu 
850 




Val Ser Leu Leu Asn Ala Thr Ala 
810 • 815 

Arg Val lie Lys lie Val Gin Arg 
825 830 

Pro Thr Arg lie Arg Gin Gly Leu 
845 



<210> 3 
<211> 2051 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<220> 

<223> human immunodeficiency virus-1 (HIV-l) envelope 
glycoprotein gpl40 truncated version of gp 160 
from primary R5X4 isolate 89.6 



<400> 3 

atgagagtga 

gggatgttga 

gtacctgtgt 

gatacagagg 

caagaagtag 

gatcagatgc 

ttaaccccac 

aatcccacta 

tatatcacca 

gatgtagtac 

tcagtcatta 

gtcccggctg 

tgcacaaatg 

ctgctgttaa 

gacaatgcta 

cccaacaaca 

agaaacataa 

aacactttac 

tttaatcaat 

gaatttttct 

acaaatggca 

aatatgtggc 

tgttcatcaa 

gagactgaga 

tataaatata 

agaacagtgc 

ggagcagcag 

ttattgtctg 

catatgttgc 

gaaagatacc 

tgcaccactt 

aataacatga 

gacttacttg 

gataaatggg 

ttattcataa 



aggagatcag 
tgatctgtag 
ggagagaagc 
tacataatgt 
tattgggaaa 
atgaggatat 
tctgtgttac 
gtagcagctg 
caagcataag 
caatagaaaa 
cacaggcctg 
ggtttgcgat 
tcagcacagt 
atggcagtct 
aaaccataat 
atacaagaag 
taggagatat 
aacagatagt 
cctcaggagg 
actgtaatac 
ctgaaggaaa 
agaaagtagg 
atattacagg 
tcttcagacc 
aagtagtaag 
aaagagaaaa 
gaagcactat 
gtatagtgca 
aactcacagt 
taagggatca 
ctgtgccttg 
cctggatgga 
aaaaatcgca 
caagtttgtg 
t 



gaagaattgg 
tgctgcaaaa 
aaccaccact 
ttgggccaca 
tgtgacagaa 
aatcagttta 
tttaaattgc 
gggaatgatg 
aaataaggta 
tactaataat 
tccaaaggta 
gctaaagtgt 
acaatgtaca 
agcagaagaa 
agtacagcta 
aaggttatct 
aagacaagca 
tataaaatta 
ggacccagaa 
agcacaactg 
tgacataatc 
aaaagcaatg 
gctgctacta 
tggaggagga 
aattgaacca 
aagagcagtg 
gggcgcagcg 
gcagcagaac 
ctggggcatc 
acagctcatg 
gaatgttagt 
gtgggaaaga 
aacccaacaa 
gaattggttt 



cagcacttga 
gaaaaaacgt 
ctattttgtg 
catgcctgtg 
aattttaaca 
tgggatgaaa 
actaatttga 
gagaaaggag 
aagaaagaat 
actaagtata 
tcctttcagc 
aacaataaga 
catggaatta 
gacatagtaa 
aatgaatctg 
ataggaccag 
cattgtaaca 
agagaaaaat 
attgtaatgc 
tttaatagta 
acactccaat 
tatgcccctc 
acaagagatg 
gatatgaggg 
ataggagtag 
ggaataggag 
tcagtgacgc 
aatctgctga 
aagcagctcc 
ggaatttggg 
tggagtaata 
gaaattgaca 
gaaaagaatg 
gacataacaa 



gagggggcat 
gggtcacaat 
catcagatgc 
tacccacaga 
tgtggaaaaa 
gcctaaagcc 
atatcactaa 
aaataaaaaa 
atgcactttt 
ggttaataag 
caattcccat 
cattcaatgg 
ggccagtggt 
ttagatctga 
tagtaattaa 
ggagagcatt 
ttagtagagc 
ttaggaataa 
acagttttaa 
cttggaatgt 
gcagaataaa 
ccatcacagg 
gaggtaatag 
acaattggag 
cacccaccag 
ctgtgttcct 
tgacggtaca 
gggctattga 
aggcaagagt 
gttgctctgg 
aatctgtgga 
attacacaga 
aaaaagaatt 
actggctgtg 



cttgctcctt 
ctattatggg 
taaagcctat 
ccccaaccca 
taacatggta 
atgtgtaaaa 
gaatactact 
ttgctctttc 
taatagactt 
ttgtaacacc 
acattattgt 
atcaggacca 
gtcaactcaa 
aaatttcaca 
ttgtacaaga 
ttatgcaaga 
aaaatggaat 
aacaatagcc 
ttgtggaggg 
tactggaggg 
acaaattata 
acaaattaga 
tactgagact 
aagtgaatta 
ggcaaagaga 
tgggttcttg 
ggccaggcta 
ggcgcaacag 
cctggctctg 
aaaactcatt 
tgatatttgg 
ctatatatat 
attggaattg 
gtatataaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2051 
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<210> 4 
<2X1> 667 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<220> 

<223> human immunodeficiency virus- 1 (HIV-l) envelope 
glycoprotein gpl4 0 truncated version of gp 160 
from primary R5X4 isolate 89.6 

<400> 4 

Met Arg Val Lys Glu lie Arg Lys Asn Trp Gin His Leu Arg Gly Gly 
15 10 15 

lie Leu Leu Leu Gly Met Leu Met lie Cys Ser Ala Ala Lys Glu Lys 
20 25 30 

Thr Trp Val Thr lie Tyr Tyr Gly Val Pro Val Trp Arg Glu Ala Thr 
35 40 45 

Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 
50 55 60 

His Asn Val Trp Ala Thr His Ala Cys VaL Pro Thr Asp Pro Asn Pro 
65 70 75 80 

Gin Glu Val Val Leu Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys 
85 90 95 

Asn Asn Met Val Asp Gin Met His Glu Asp lie lie Ser Leu Trp Asp 
100 105 110 

Glu Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
115 120 125 

Asn Cys Thr Asn Leu Asn lie Thr Lys Asn Thr Thr Asn Pro Thr Ser 
130 135 140 

Ser Ser Trp Gly Met Met Glu Lys Gly Glu lie Lys Asn Cys Ser Phe 
145 150 155 160 

Tyr lie Thr Thr Ser lie Arg Asn Lys Val Lys Lys Glu Tyr Ala Leu 
165 170 175 

Phe Asn Arg Leu Asp Val Val Pro lie Glu Asn Thr Asn Asn Thr Lys 
180 185 190 

Tyr Arg Leu lie Ser Cys Asn Thr Ser Val lie Thr Gin Ala Cys Pro 
195 200 205 

Lys Val Ser Phe Gin Pro lie Pro lie His Tyr Cys Val Pro Ala Gly 
210 215 220 

Phe Ala Met Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Ser Gly Pro 
225 230 235 240 

Cys Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val 
245 250 255 

Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Asp He 
260 265 270 
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Val lie Arg Ser 
275 

Gin Leu Asn Glu 
290 

Thr Arg Arg Arg 
305 

Arg Asn lie lie 



Ala Lys Trp Asn 
340 

Lys Phe Arg Asn 
355 

Pro Glu lie Val 
370 

Cys Asn Thr Ala 
385 

Thr Asn Gly Thr 



Lys Gin lie lie 
420 

Pro Pro lie Thr 
435 

Leu Leu Thr Arg 
450 

Phe Arg Pro Gly 
465 

Tyr Lys Tyr Lys 



Arg Ala Lys Arg 
500 

Gly Ala Val Phe 
515 

Ala Ala Ser Val 
530 

lie Val Gin Gin 
545 

His Met Leu Gin 



Val Leu Ala Leu 
580 




Glu Asn Phe Thr 
280 

Ser Val Val lie 
295 

Leu Ser lie Gly 
310 

Gly Asp lie Arg 
325 

Asn Thr Leu Gin 



Lys Thr lie Ala 
360 

Met His Ser Phe 
375 

Gin Leu Phe Asn 
390 

Glu Gly Asn Asp 
405 

Asn Met Trp Gin 



Gly Gin lie Arg 
440 

Asp Gly Gly Asn 
455 

Gly Gly Asp Met 
470 

Val Val Arg lie 
485 

Arg Thr Val Gin 



Leu Gly Phe Leu 
520 

Thr Leu Thr Val 
535 

Gin Asn Asn Leu 
550 

Leu Thr Val Trp 
565 

Glu Arg Tyr Leu 



Asp Asn Ala Lys 



Asn Cys Thr Arg 
300 

Pro Gly Arg Ala 
315 

Gin Ala His Cys 
330 

Gin lie Val lie 

345 

Phe Asn Gin Ser 



Asn Cys Gly Gly 
380 

Ser Thr Trp Asn 
395 

lie lie Thr Leu 
410 

Lys Val Gly Lys 
425 

Cys Ser Ser Asn 



Ser Thr Glu Thr 
460 

Arg Asp Asn Trp 
475 

Glu Pro lie Gly 
490 

Arg Glu Lys Arg 
505 

Gly Ala Ala Gly 



Gin Ala Arg Leu 
540 

Leu Arg Ala lie 
555 

Gly lie Lys Gin 
570 

Arg Asp Gin Gin 
585 




Thr lie lie Val 

285 

Pro Asn Asn Asn 



Phe Tyr Ala Arg 
320 

Asn lie Ser Arg 
335 

Lys Leu Arg Glu 
350 

Ser Gly Gly Asp 
365 

Glu Phe Phe Tyr 



Val Thr Gly Gly 
400 

Gin Cys Arg lie 
415 

Ala Met Tyr Ala 
430 

lie Thr Gly Leu 
445 

Glu Thr Glu lie 



Arg Ser Glu Leu 
480 

Val Ala Pro Thr 
495 

Ala Val Gly lie 
510 

Ser Thr Met Gly 
525 

Leu Leu Ser Gly 



Glu Ala Gin Gin 
560 

Leu Gin Ala Arg 
575 

Leu Met Gly lie 
590 
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Trp Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ser Val Pro Trp Asn 
595 600 605 



Val Ser Trp Ser Asn Lys Ser Val 
610 615 

Trp Met Glu Trp Glu Arg Glu lie 
625 630 

Asp Leu Leu Glu Lys Ser Gin Thr 
645 

Leu Leu Glu Leu Asp Lys Trp Ala 
660 



Asp Asp lie Trp Asn Asn Met Thr 
620 

Asp Asn Tyr Thr Asp Tyr lie Tyr 

635 640 

Gin Gin Glu Lys Asn Glu Lys Glu 
650 655 

Ser Leu Trp 
665 



<210> 5 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : cleavage site 
and start of gp41 in gpl60 (env 89.6) 

<400> 5 

caaagagaaa aaagagcagt gggaata 27 



<210> 6 
<211> 8 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : cleavage site 
and start of gp41 in gpl60 (env 89.6) 

<400> 6 

Arg Glu Lys Arg Ala Val Gly lie 
1 5 



<210> 7 

<211> 667 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : gpl40 -26 
(soluble, secreted protein) 

<400> 7 

Lys Glu Lys Thr Trp Val Thr lie Tyr Tyr Gly Val Pro Val Trp Arg 
15 10 15 

Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp 
20 25 30 

Thr Glu Val His Asn Gly Trp Ala Thr His Ala Cys Val Ala Thr Asp 
35 40 45 
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Pro Asn Pro Gin 
50 

Met Trp Lys Asn 
65 

Leu Leu Asp Glu 



Val Thr Leu Asn 
100 

Pro Thr Ser Ser 
115 

Cys Ser Phe Tyr 
130 

Tyr Ala Leu Phe 
145 

Asn Thr Lys Tyr 



Ala Cys Pro Lys 
180 

Pro Ala Gly Phe 
195 

Ser Gly Pro Cys 
210 

Arg Pro Val Val 
225 

Glu Asp lie Val 



He He Val Gin 
260 

Asn Asn Asn Thr 
275 

Tyr Ala Arg Arg 
290 

He Ser Arg Ala 
305 

Leu Arg Glu Lys 



Gly Gly Asp Pro 
340 

Phe Phe Tyr Cys 
355 



Glu Val Val Leu 
55 

Asn Met Val Asp 
70 

Ser Leu Lys Pro 
85 

Cys Thr Asn Leu 



Ser Leu Gly Met 
120 

He Thr Thr Ser 
135 

Asn Arg Leu Asp 
150 

Arg Leu He Ser 
165 

Val Phe Phe Gin 



Ala Met Leu Lys 
200 

Thr Asn Val Ser 
215 

Ser Thr Gin Leu 
230 

He Arg Ser Gly 
245 

Leu Asn Glu Ser 



Arg Arg Arg Leu 
280 

Asn He He Gly 
295 

Lys Leu Asn Asn 
310 

Phe Arg Asn Lys 
325 

Glu He Val Met 



Asn Thr Ala Gin 
360 



Gly Asn Val Thr 

60 

Gin Met His Glu 
75 

Cys Val Lys Leu 
90 

Asn He Thr Lys 
105 

Met Glu Lys Gly 



He Arg Asn Lys 
140 

Val Val Pro He 
155 

Cys Asn Thr Ser 
170 

Pro He Ala He 
185 

Cys Asn Asn Lys 



Thr Val Pro Cys 
220 

Leu Leu Asn Gly 
235 

Asn Phe Thr Asp 
250 

Val Val He Asn 
265 

Ser He Gly Pro 



Asp He Arg Gin 
300 

Thr Leu Gin Gin 
315 

Thr He Ala Phe 
330 

His Ser Phe Asn 
345 

Leu Phe Asn Ser 



Glu Asn Phe Asn 



Asp He He Ser 
80 

Thr Pro Leu Cys 
95 

Asn Thr Thr Asn 
110 

Glu He Lys Asn 
125 

Val Lys Lys Glu 



Glu Asn Thr Asn 
160 

Val He Thr Gin 
175 

His Tyr Cys Val 
190 

Thr Phe Asn Gly 
205 

Thr His Gly He 



Ser Leu Ala Glu 
240 

Asn Ala Lys Thr 
255 

Cys Thr Arg Pro 
270 

Gly Arg Ala Phe 
285 

Ala His Cys Asn 



He Val He Lys 
320 

Asn Gin Ser Ser 
335 

Cys Gly Gly Glu 
350 

Thr Leu Asn Val 
365 
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Thr Gly Gly Thr Asn Gly Thr Glu Glu Asn Asp lie lie Thr Leu Gin 

370 375 . 380 

Cys Arg lie Lys Gin lie lie Asn Met Trp Gin Lys Val Gly Lys Ala 
385 390 395 400 

Met Tyr Ala Pro Pro lie Thr Gly Gin lie lie Cys Ser Ser Asn He 
405 410 415 

Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Ser Thr Glu Thr Glu 
420 425 430 

Thr Glu He Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg 
435 440 445 

Ser Glu Leu Tyr Lys Tyr Lys Val Val Arg He Glu Pro He Gly Val 
450 455 460 

Ala Pro Thr Arg Ala Lys Arg Arg Thr Cys Gin Gly Gly He Asp Gly 
465 470 475 480 

He Leu Gin He Ser Gly Ser Gly Ser Gly Gly Ser Gly Gin Gly Ser 
485 490 495 

Ser Ser Gly Gly Ala Gly Gly Lys Gly Ala Val Gly He Gly Ala Val 
500 505 510 

Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Arg Ser 
515 520 525 

Val Thr Leu Thr Val Gin Ala Arg Leu Leu Leu Ser Gly He Val Gin 
530 535 540 

Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Met Leu 
545 550 555 560 

Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu Ala 
565 570 575 

Leu Glu Arg Tyr Leu Arg Asp Gin Gin Leu Met Gly He Trp Gly Cys 
580 585 590 

Ser Gly Lys Leu He Cys Thr Thr Ser Val Pro Trp Asn Val Ser Trp 
595 600 605 

Ser Asn Lys Ser Val Asp Asp He Trp Asn Asn Met Thr Trp Met Glu 
610 615 620 

Leu Glu Arg Glu He Asp Asn Tyr Thr Asp Tyr He Tyr Asp Leu Leu 
625 630 635 640 

Glu Lys Ser Gin Thr Gin Gin Glu Lys Asn Glu Lys Glu Leu Leu Glu 
645 650 655 

Leu Asp Lys Trp Ala Ser Leu Trp Lys Leu Val 
660 665 
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<210> 8 
<211> 656 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :gpl40- 15 
(soluble, secreted protein) 

<400> 8 

Lys Glu Lys Thr Trp Val Thr He Tyr Tyr Gly Val Pro Val Trp Arg 
15 10 15 

Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp 
20 25 30 

Thr Glu Val His Asn Gly Trp Ala Thr His Ala Cys Val Ala Thr Asp 
35 40 45 

Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu Asn Phe Asn 
50 55 60 

Met Trp Lys Asn Asn Met Val Asp Gin Met His Glu Asp He He Ser 
65 70 75 80 

Leu Leu Asp Glu Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys 
85 90 95 

Val Thr Leu Asn Cys Thr Asn Leu Asn lie Thr Lys Asn Thr Thr Asn 
100 105 110 

Pro Thr Ser Ser Ser Leu Gly Met Met Glu Lys Gly Glu lie Lys Asn 
115 120 125 

Cys Ser Phe Tyr He Thr Thr Ser He Arg Asn Lys Val Lys Lys Glu 
130 135 140 

Tyr Ala Leu Phe Asn Arg Leu Asp Val Val Pro He Glu Asn Thr Asn 
145 150 155 160 

Asn Thr Lys Tyr Arg Leu He Ser Cys Asn Thr Ser Val He Thr Gin 
165 170 175 

Ala Cys Pro Lys Val Phe Phe Gin Pro He Ala He His Tyr Cys Val 
180 185 190 

Pro Ala Gly Phe Ala Met Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly 
195 200 205 

Ser Gly Pro Cys Thr Asn Val Ser Thr Val Pro Cys Thr His Gly He 
210 215 220 

Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu 
225 230 235 240 

Glu Asp He Val He Arg Ser Gly Asn Phe Thr Asp Asn Ala Lys Thr 
245 250 255 

He He Val Gin Leu Asn Glu Ser Val Val He Asn Cys Thr Arg Pro 
260 265 270 
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Asn Asn Asn Thr 
275 

Tyr Ala Arg Arg 
290 

lie Ser Arg Ala 
305 

Leu Arg Glu Lys 



Gly Gly Asp Pro 
340 

Phe Phe Tyr Cys 
355 

Thr Gly Gly Thr 
370 

Cys Arg lie Lys 
385 

Met Tyr Ala Pro 



Thr Gly Leu Leu 
420 

Thr Glu lie Phe 
435 

Ser Glu Leu Tyr 
450 

Ala Pro Thr Arg 
465 

lie Leu Gin lie 



Gly lie Gly Ala 
500 

Met Gly Ala Arg 
515 

Ser Gly lie Val 
530 

Gin Gin His Met 
545 

Ala Arg Val Leu 



Gly. lie Trp Gly 
580 



Arg Arg Arg Leu 
280 

Asn lie lie Gly 
295 

Lys Leu Asn Asn 
310 

Phe Arg Asn Lys 
325 

Glu lie Val Met 



Asn Thr Ala Gin 
360 

Asn Gly Thr Glu 
375 

Gin lie lie Asn 
390 

Pro lie Thr Gly 
405 

Leu Thr Arg Asp 



Arg Pro Gly Gly 
440 

Lys Tyr Lys Val 
455 

Ala Lys Arg Arg 
470 

Ser Ser Ser Gly 
485 

Val Phe Leu Gly 



Ser Val Thr Leu 
520 

Gin Gin Gin Asn 
535 

Leu Gin Leu Thr 
550 

Ala Leu Glu Arg 
565 

Cys Ser Gly Lys 



Ser lie Gly Pro 



Asp lie Arg Gin 
300 

Thr Leu Gin Gin 
315 

Thr lie Ala Phe 
330 

His Ser Phe Asn 
345 

Leu Phe Asn Ser 



Glu Asn Asp lie 
380 

Met Trp Gin Lys 
395 

Gin lie lie Cys 
410 

Gly Gly Asn Ser 
425 

Gly Asp Met Arg 



Val Arg lie Glu 
460 

Thr Cys Gin Gly 
475 

Gly Ala Gly Gly 
490 

Phe Leu Gly Ala 
505 

Thr Val Gin Ala 



Asn Leu Leu Arg 
540 

Val Trp Gly lie 
555 

Tyr Leu Arg Asp 
570 

Leu lie Cys Thr 
585 



Gly Arg Ala Phe 
285 

Ala His Cys Asn 



lie Val lie Lys 
320 

Asn Gin Ser Ser 
335 

Cys Gly Gly Glu 
350 

Thr Leu Asn Val 
365 

lie Thr Leu Gin 



Val. Gly Lys Ala 
400 

Ser Ser Asn lie 
415 

Thr Glu Thr Glu 
430 

Asp Asn Trp Arg 
445 

Pro lie Gly Val 



Gly lie Asp Gly 
480 

Lys Gly Ala Val 
495 

Ala Gly Ser Thr 
510 

Arg Leu Leu Leu 
525 

Ala He Glu Ala 



Lys Gin Leu Gin 
560 

Gin Gin Leu Met 
575 

Thr Ser Val Pro 
590 
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Trp Asn Val Ser 
595 

Met Thr Trp Met 
610 

lie Tyr Asp Leu 
625 

Lys Glu Leu Leu 



Trp Ser Asn Lys 
600 

Glu Leu Glu Arg 
615 

Leu Glu Lys Ser 
630 

Glu Leu Asp Lys 
645 



Ser Val Asp Asp 



Glu lie Asp Asn 
620 

Gin Thr Gin Gin 
635 

Trp Ala Ser Leu 
650 



lie Trp Asn Asn 
605 

Tyr Thr Asp Tyr 



Glu Lys Asn Glu 
640 

Trp Lys Leu Val 
655 



<210> 9 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : peptide linker 
<400> 9 

Gly lie Leu lie 
1 



<210> 10 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : preferred 
peptide linker 

<400> 10 

Gly Gly lie Asp Gly lie Leu Gin lie Ser Ser Ser Gly Gly Ala 
15 10 15 



<210> 11 
<211> 26 
<212> PRT . 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : preferred 
peptide linker 

<400> 11 

Gly Gly lie Asp Gly lie Leu Gin lie Ser Gly Ser Gly Ser Gly Gly 
15 10 15 

Ser Gly Gin Gly Ser Ser Ser Gly Gly Ala 
20 25 



<210> 12 
<211> 15 
<212> PRT 

<213> Artificial Secjuence 
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<220> 

<223> Description of Artificial Sequence : preferred 
peptide linker 

<400> 12 

Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
15 10 15 



<210> 13 

<211> 20 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence : preferred 
peptide linker 

<400> 13 

Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
15 10 15 



Gly Ser Gly Gly 
20 



<210> 14 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : preferred 
peptide linker 

<400> 14 

Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
15 10 15 

Gly Ser Gly Gly Gly Gly Ser Gly Gly 
20 25 



<210> 15 
<211> 24 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<220> 

<221> MOD_RES 
<222> (7) . . (24) 

<223> amino acids at positions 7-8, 9-10, 11-12, 13-14, 
15-16, 17-18, 19-20, 21-22, and 23-24 may be 
present or absent 
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oirser Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser 
1 5 10 



Gly Ser Gly Ser Gly Ser Gly Ser 
20 



<210> 16 
<211> 48 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : exemplary 
linker 

<220> 

<221> MOD_RES 

<222> (13).. (48) P2-24 
<223> amino acids at positions ^3"^^' ""^l, ^2 24 

25-28, 29-32, 33-36, 37-40, 41-44, and 45-48 may 

be present or absent 



S'lrcli'oiy ser Oly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser 
20 25 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser 



35 40 



<210> 17 
<211> 60 
<212> PRT 

<213> Artificial Sequence 



<223> Description of Artificial Sequence : exemplary 
linker 

<220> 

<221> MOD_RES 

<222> (16).. (60) 96-30 

present or absent 
oirGlJ^Gly Gly ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
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Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 



20 25 



15 



Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
35 40 45 

Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
50 55 60 



<210> 18 
<211> 72 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<220> 

<221> MOD_RES 
<222> (19) . . (72) 

<223> amino acids at positions 19-24, 25-30, 31-36, 

37-42, 43-48, 49-54, 55-60, 61-66 and 67-72 may be 
present or absent 

<400> 18 

Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
1 5 10 15 

Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly 
20 25 30 

Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser 
35 40 45 

Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
50 55 60 

Gly Ser Gly Gly Gly Gly Gly Ser 
65 70 



<210> 19 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 19 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser 
15 10 



<210> 20 
<211> 20 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 20 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
15 10 15 

Gly Gly Gly Ser 
20 



<210> 21 
<211> 30 
<212> PRT 

<213> Artificial Sequence 

<220> 

<22 3> Description of Artificial Sequence : exemplary 
linker 

<400> 21 

Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
15 10 15 

Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser 
20 25 30 



<210> 22 
<211> 42 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 22 

Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Ser Gly Gly 
15 10 15 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
20 • 25 30 

Gly Gly Ser Gly Gly Gly Gly Gly Gly Ser 
35 40 



<210> 23 
<211> 56 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 23 

Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Ser 
1 5 10 15 
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Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Ser 
20 25 30 

Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Ser 
35 40 45 

Gly Gly Gly Gly Gly Gly Gly Ser 
50 55 



<210> 24 
<211> 72 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 24 

Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly 
15 10 15 

Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly 
20 25 30 

Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly 
35 40 45 

Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly 
50 55 60 

Gly Gly Gly Gly Gly Gly Gly Ser 
65 70 



<210> 25 
<211> 90 
<212> PRT 

<213> Artificial Secjuence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 25 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly 
15 10 15 

Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly 
20 25 30 

Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly 
35 40 45 

Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
50 55 60 
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Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser 
65 70 75 80 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser 
85 90 



<210> 26 
<211> 110 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 26 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly 
15 10 15 

Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
20 25 30 

Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
35 40 45 

Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly 
50 55 60 

Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly 
65 70 75 80 

Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly 
85 90 95 

Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser 
100 105 110 



<210> 27 
<211> 132 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 27 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
15 10 15 

Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly 
20 25 30 

Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser 
35 40 45 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
50 55 60 
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4 



Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly 
65 70 75 80 

Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser 
85 90 95 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
100 105 110 

Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly 
115 120 125 

Gly Gly Gly Ser 
130 



<210> 28 
<211> 156 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 28 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly 
15 10 15 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly 

20 , ^ 25 30 

Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly 
35 40 45 

Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
50 55 60 

Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly 
65 70 75 80 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly 
85 90 95 

Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly 
100 105 110 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
115 120 125 

Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly 
130 135 140 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser 
145 150 155 



<210> 29 
<211> 36 
<212> PRT 

<213> Artificial Sequence 
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• 



<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<220> 

<221> MOD_RES 
<222> (10) . . (36) 

<223> amino acids at positions 10-12, 13-15, 16-18, 

19-21, 22-24, 25-27, 28-30, 31-33 and 34-36 may be 
present or absent 

laaGly^S^r Ala Gly Ser Ala Gly Ser Ala Gly Ser Ala Gly Ser Ala 
15 10 15 

Gly ser Ala Gly Ser Ala Gly Ser Ala Gly Ser Ala Gly Ser Ala Gly 
20 25 30 

Ser Ala Gly Ser 
35 



<210> 30 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :post 
translational cleavage site 

<400> 30 
Arg Glu Lys Arg 
1 



<210> 31 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : mutated post 
translational cleavage site 

<400> 31 
Arg Glu lie Asp 
1 



<210> 32 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : short fragment 
following mutated cleavage site created by 
introduction of restriction sites 
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<400> 32 
Glu Phe lie Ser 
1 



<210> 33 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secjuence : polypeptide 
linker end 

<400> 33 

Gly Gly Ser Gly Gly 
1 5 
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